Overview

Brought to you by YData

Dataset statistics

Number of variables18
Number of observations36275
Missing cells0
Missing cells (%)0.0%
Duplicate rows3138
Duplicate rows (%)8.7%
Total size in memory5.0 MiB
Average record size in memory144.0 B

Variable types

Categorical8
Numeric10

Alerts

Dataset has 3138 (8.7%) duplicate rowsDuplicates
no_of_previous_bookings_not_canceled is highly overall correlated with repeated_guestHigh correlation
repeated_guest is highly overall correlated with no_of_previous_bookings_not_canceledHigh correlation
no_of_adults is highly imbalanced (52.4%) Imbalance
required_car_parking_space is highly imbalanced (80.1%) Imbalance
room_type_reserved is highly imbalanced (62.5%) Imbalance
repeated_guest is highly imbalanced (82.8%) Imbalance
no_of_previous_cancellations is highly skewed (γ1 = 25.19987595) Skewed
no_of_children has 33577 (92.6%) zeros Zeros
no_of_weekend_nights has 16872 (46.5%) zeros Zeros
no_of_week_nights has 2387 (6.6%) zeros Zeros
lead_time has 1297 (3.6%) zeros Zeros
no_of_previous_cancellations has 35937 (99.1%) zeros Zeros
no_of_previous_bookings_not_canceled has 35463 (97.8%) zeros Zeros
avg_price_per_room has 545 (1.5%) zeros Zeros
no_of_special_requests has 19777 (54.5%) zeros Zeros

Reproduction

Analysis started2025-03-20 05:34:26.060543
Analysis finished2025-03-20 05:34:31.944574
Duration5.88 seconds
Software versionydata-profiling vv4.14.0
Download configurationconfig.json

Variables

no_of_adults
Categorical

Imbalance 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
2
26108 
1
7695 
3
 
2317
0
 
139
4
 
16

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters36275
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row1
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

Length

2025-03-20T11:04:31.980765image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.014430image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 26108
72.0%
1 7695
 
21.2%
3 2317
 
6.4%
0 139
 
0.4%
4 16
 
< 0.1%

no_of_children
Real number (ℝ)

Zeros 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.10527912
Minimum0
Maximum10
Zeros33577
Zeros (%)92.6%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.045919image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.40264806
Coefficient of variation (CV)3.8245767
Kurtosis36.981856
Mean0.10527912
Median Absolute Deviation (MAD)0
Skewness4.7103495
Sum3819
Variance0.16212546
MonotonicityNot monotonic
2025-03-20T11:04:32.079594image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 33577
92.6%
1 1618
 
4.5%
2 1058
 
2.9%
3 19
 
0.1%
9 2
 
< 0.1%
10 1
 
< 0.1%
ValueCountFrequency (%)
0 33577
92.6%
1 1618
 
4.5%
2 1058
 
2.9%
3 19
 
0.1%
9 2
 
< 0.1%
10 1
 
< 0.1%
ValueCountFrequency (%)
10 1
 
< 0.1%
9 2
 
< 0.1%
3 19
 
0.1%
2 1058
 
2.9%
1 1618
 
4.5%
0 33577
92.6%

no_of_weekend_nights
Real number (ℝ)

Zeros 

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.81072364
Minimum0
Maximum7
Zeros16872
Zeros (%)46.5%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.110152image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile2
Maximum7
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.87064361
Coefficient of variation (CV)1.0739092
Kurtosis0.29885756
Mean0.81072364
Median Absolute Deviation (MAD)1
Skewness0.73761596
Sum29409
Variance0.7580203
MonotonicityNot monotonic
2025-03-20T11:04:32.143905image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 16872
46.5%
1 9995
27.6%
2 9071
25.0%
3 153
 
0.4%
4 129
 
0.4%
5 34
 
0.1%
6 20
 
0.1%
7 1
 
< 0.1%
ValueCountFrequency (%)
0 16872
46.5%
1 9995
27.6%
2 9071
25.0%
3 153
 
0.4%
4 129
 
0.4%
5 34
 
0.1%
6 20
 
0.1%
7 1
 
< 0.1%
ValueCountFrequency (%)
7 1
 
< 0.1%
6 20
 
0.1%
5 34
 
0.1%
4 129
 
0.4%
3 153
 
0.4%
2 9071
25.0%
1 9995
27.6%
0 16872
46.5%

no_of_week_nights
Real number (ℝ)

Zeros 

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2043005
Minimum0
Maximum17
Zeros2387
Zeros (%)6.6%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.178974image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum17
Range17
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4109049
Coefficient of variation (CV)0.6400692
Kurtosis7.7982839
Mean2.2043005
Median Absolute Deviation (MAD)1
Skewness1.5993504
Sum79961
Variance1.9906525
MonotonicityNot monotonic
2025-03-20T11:04:32.216944image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 11444
31.5%
1 9488
26.2%
3 7839
21.6%
4 2990
 
8.2%
0 2387
 
6.6%
5 1614
 
4.4%
6 189
 
0.5%
7 113
 
0.3%
10 62
 
0.2%
8 62
 
0.2%
Other values (8) 87
 
0.2%
ValueCountFrequency (%)
0 2387
 
6.6%
1 9488
26.2%
2 11444
31.5%
3 7839
21.6%
4 2990
 
8.2%
5 1614
 
4.4%
6 189
 
0.5%
7 113
 
0.3%
8 62
 
0.2%
9 34
 
0.1%
ValueCountFrequency (%)
17 3
 
< 0.1%
16 2
 
< 0.1%
15 10
 
< 0.1%
14 7
 
< 0.1%
13 5
 
< 0.1%
12 9
 
< 0.1%
11 17
 
< 0.1%
10 62
0.2%
9 34
0.1%
8 62
0.2%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
Meal Plan 1
27835 
Not Selected
5130 
Meal Plan 2
3305 
Meal Plan 3
 
5

Length

Max length12
Median length11
Mean length11.14142
Min length11

Characters and Unicode

Total characters404155
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMeal Plan 1
2nd rowNot Selected
3rd rowMeal Plan 1
4th rowMeal Plan 1
5th rowNot Selected

Common Values

ValueCountFrequency (%)
Meal Plan 1 27835
76.7%
Not Selected 5130
 
14.1%
Meal Plan 2 3305
 
9.1%
Meal Plan 3 5
 
< 0.1%

Length

2025-03-20T11:04:32.259565image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.287921image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
meal 31145
30.0%
plan 31145
30.0%
1 27835
26.8%
not 5130
 
4.9%
selected 5130
 
4.9%
2 3305
 
3.2%
3 5
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
l 67420
16.7%
67420
16.7%
a 62290
15.4%
e 46535
11.5%
M 31145
7.7%
P 31145
7.7%
n 31145
7.7%
1 27835
6.9%
t 10260
 
2.5%
N 5130
 
1.3%
Other values (6) 23830
 
5.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 404155
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 67420
16.7%
67420
16.7%
a 62290
15.4%
e 46535
11.5%
M 31145
7.7%
P 31145
7.7%
n 31145
7.7%
1 27835
6.9%
t 10260
 
2.5%
N 5130
 
1.3%
Other values (6) 23830
 
5.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 404155
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 67420
16.7%
67420
16.7%
a 62290
15.4%
e 46535
11.5%
M 31145
7.7%
P 31145
7.7%
n 31145
7.7%
1 27835
6.9%
t 10260
 
2.5%
N 5130
 
1.3%
Other values (6) 23830
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 404155
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 67420
16.7%
67420
16.7%
a 62290
15.4%
e 46535
11.5%
M 31145
7.7%
P 31145
7.7%
n 31145
7.7%
1 27835
6.9%
t 10260
 
2.5%
N 5130
 
1.3%
Other values (6) 23830
 
5.9%

required_car_parking_space
Categorical

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
0
35151 
1
 
1124

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters36275
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

Length

2025-03-20T11:04:32.325585image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.349717image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

Most occurring characters

ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 35151
96.9%
1 1124
 
3.1%

room_type_reserved
Categorical

Imbalance 

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
Room_Type 1
28130 
Room_Type 4
6057 
Room_Type 6
 
966
Room_Type 2
 
692
Room_Type 5
 
265
Other values (2)
 
165

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters399025
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRoom_Type 1
2nd rowRoom_Type 1
3rd rowRoom_Type 1
4th rowRoom_Type 1
5th rowRoom_Type 1

Common Values

ValueCountFrequency (%)
Room_Type 1 28130
77.5%
Room_Type 4 6057
 
16.7%
Room_Type 6 966
 
2.7%
Room_Type 2 692
 
1.9%
Room_Type 5 265
 
0.7%
Room_Type 7 158
 
0.4%
Room_Type 3 7
 
< 0.1%

Length

2025-03-20T11:04:32.379690image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.412868image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
room_type 36275
50.0%
1 28130
38.8%
4 6057
 
8.3%
6 966
 
1.3%
2 692
 
1.0%
5 265
 
0.4%
7 158
 
0.2%
3 7
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o 72550
18.2%
R 36275
9.1%
m 36275
9.1%
_ 36275
9.1%
T 36275
9.1%
y 36275
9.1%
p 36275
9.1%
e 36275
9.1%
36275
9.1%
1 28130
 
7.0%
Other values (6) 8145
 
2.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 399025
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 72550
18.2%
R 36275
9.1%
m 36275
9.1%
_ 36275
9.1%
T 36275
9.1%
y 36275
9.1%
p 36275
9.1%
e 36275
9.1%
36275
9.1%
1 28130
 
7.0%
Other values (6) 8145
 
2.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 399025
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 72550
18.2%
R 36275
9.1%
m 36275
9.1%
_ 36275
9.1%
T 36275
9.1%
y 36275
9.1%
p 36275
9.1%
e 36275
9.1%
36275
9.1%
1 28130
 
7.0%
Other values (6) 8145
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 399025
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 72550
18.2%
R 36275
9.1%
m 36275
9.1%
_ 36275
9.1%
T 36275
9.1%
y 36275
9.1%
p 36275
9.1%
e 36275
9.1%
36275
9.1%
1 28130
 
7.0%
Other values (6) 8145
 
2.0%

lead_time
Real number (ℝ)

Zeros 

Distinct352
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85.232557
Minimum0
Maximum443
Zeros1297
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.461604image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q117
median57
Q3126
95-th percentile273
Maximum443
Range443
Interquartile range (IQR)109

Descriptive statistics

Standard deviation85.930817
Coefficient of variation (CV)1.0081924
Kurtosis1.1795941
Mean85.232557
Median Absolute Deviation (MAD)47
Skewness1.2924915
Sum3091811
Variance7384.1053
MonotonicityNot monotonic
2025-03-20T11:04:32.512547image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1297
 
3.6%
1 1078
 
3.0%
2 643
 
1.8%
3 630
 
1.7%
4 628
 
1.7%
5 577
 
1.6%
6 519
 
1.4%
8 436
 
1.2%
7 429
 
1.2%
12 412
 
1.1%
Other values (342) 29626
81.7%
ValueCountFrequency (%)
0 1297
3.6%
1 1078
3.0%
2 643
1.8%
3 630
1.7%
4 628
1.7%
5 577
1.6%
6 519
1.4%
7 429
 
1.2%
8 436
 
1.2%
9 332
 
0.9%
ValueCountFrequency (%)
443 22
 
0.1%
433 20
 
0.1%
418 60
0.2%
386 69
0.2%
381 2
 
< 0.1%
377 69
0.2%
372 1
 
< 0.1%
361 5
 
< 0.1%
359 16
 
< 0.1%
355 1
 
< 0.1%

arrival_year
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
2018
29761 
2017
6514 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters145100
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 29761
82.0%
2017 6514
 
18.0%

Length

2025-03-20T11:04:32.557305image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.583187image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
2018 29761
82.0%
2017 6514
 
18.0%

Most occurring characters

ValueCountFrequency (%)
2 36275
25.0%
0 36275
25.0%
1 36275
25.0%
8 29761
20.5%
7 6514
 
4.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 145100
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 36275
25.0%
0 36275
25.0%
1 36275
25.0%
8 29761
20.5%
7 6514
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 145100
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 36275
25.0%
0 36275
25.0%
1 36275
25.0%
8 29761
20.5%
7 6514
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 145100
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 36275
25.0%
0 36275
25.0%
1 36275
25.0%
8 29761
20.5%
7 6514
 
4.5%

arrival_month
Real number (ℝ)

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.4236527
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.608704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.0698944
Coefficient of variation (CV)0.41352883
Kurtosis-0.93318896
Mean7.4236527
Median Absolute Deviation (MAD)2
Skewness-0.34822885
Sum269293
Variance9.4242517
MonotonicityNot monotonic
2025-03-20T11:04:32.643012image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
10 5317
14.7%
9 4611
12.7%
8 3813
10.5%
6 3203
8.8%
12 3021
8.3%
11 2980
8.2%
7 2920
8.0%
4 2736
7.5%
5 2598
7.2%
3 2358
6.5%
Other values (2) 2718
7.5%
ValueCountFrequency (%)
1 1014
 
2.8%
2 1704
 
4.7%
3 2358
6.5%
4 2736
7.5%
5 2598
7.2%
6 3203
8.8%
7 2920
8.0%
8 3813
10.5%
9 4611
12.7%
10 5317
14.7%
ValueCountFrequency (%)
12 3021
8.3%
11 2980
8.2%
10 5317
14.7%
9 4611
12.7%
8 3813
10.5%
7 2920
8.0%
6 3203
8.8%
5 2598
7.2%
4 2736
7.5%
3 2358
6.5%

arrival_date
Real number (ℝ)

Distinct31
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.596995
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.680299image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q323
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.7404474
Coefficient of variation (CV)0.56039303
Kurtosis-1.157214
Mean15.596995
Median Absolute Deviation (MAD)8
Skewness0.028808569
Sum565781
Variance76.39542
MonotonicityNot monotonic
2025-03-20T11:04:32.724837image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
13 1358
 
3.7%
17 1345
 
3.7%
2 1331
 
3.7%
4 1327
 
3.7%
19 1327
 
3.7%
16 1306
 
3.6%
20 1281
 
3.5%
15 1273
 
3.5%
6 1273
 
3.5%
18 1260
 
3.5%
Other values (21) 23194
63.9%
ValueCountFrequency (%)
1 1133
3.1%
2 1331
3.7%
3 1098
3.0%
4 1327
3.7%
5 1154
3.2%
6 1273
3.5%
7 1110
3.1%
8 1198
3.3%
9 1130
3.1%
10 1089
3.0%
ValueCountFrequency (%)
31 578
1.6%
30 1216
3.4%
29 1190
3.3%
28 1129
3.1%
27 1059
2.9%
26 1146
3.2%
25 1146
3.2%
24 1103
3.0%
23 990
2.7%
22 1023
2.8%
Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
Online
23214 
Offline
10528 
Corporate
 
2017
Complementary
 
391
Aviation
 
125

Length

Max length13
Median length6
Mean length6.5393797
Min length6

Characters and Unicode

Total characters237216
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOffline
2nd rowOnline
3rd rowOnline
4th rowOnline
5th rowOnline

Common Values

ValueCountFrequency (%)
Online 23214
64.0%
Offline 10528
29.0%
Corporate 2017
 
5.6%
Complementary 391
 
1.1%
Aviation 125
 
0.3%

Length

2025-03-20T11:04:32.772997image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.803807image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
online 23214
64.0%
offline 10528
29.0%
corporate 2017
 
5.6%
complementary 391
 
1.1%
aviation 125
 
0.3%

Most occurring characters

ValueCountFrequency (%)
n 57472
24.2%
e 36541
15.4%
l 34133
14.4%
i 33992
14.3%
O 33742
14.2%
f 21056
 
8.9%
o 4550
 
1.9%
r 4425
 
1.9%
a 2533
 
1.1%
t 2533
 
1.1%
Other values (6) 6239
 
2.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 237216
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n 57472
24.2%
e 36541
15.4%
l 34133
14.4%
i 33992
14.3%
O 33742
14.2%
f 21056
 
8.9%
o 4550
 
1.9%
r 4425
 
1.9%
a 2533
 
1.1%
t 2533
 
1.1%
Other values (6) 6239
 
2.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 237216
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n 57472
24.2%
e 36541
15.4%
l 34133
14.4%
i 33992
14.3%
O 33742
14.2%
f 21056
 
8.9%
o 4550
 
1.9%
r 4425
 
1.9%
a 2533
 
1.1%
t 2533
 
1.1%
Other values (6) 6239
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 237216
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n 57472
24.2%
e 36541
15.4%
l 34133
14.4%
i 33992
14.3%
O 33742
14.2%
f 21056
 
8.9%
o 4550
 
1.9%
r 4425
 
1.9%
a 2533
 
1.1%
t 2533
 
1.1%
Other values (6) 6239
 
2.6%

repeated_guest
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
0
35345 
1
 
930

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters36275
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

Length

2025-03-20T11:04:32.843704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:32.867725image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

Most occurring characters

ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 36275
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 35345
97.4%
1 930
 
2.6%

no_of_previous_cancellations
Real number (ℝ)

Skewed  Zeros 

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.023349414
Minimum0
Maximum13
Zeros35937
Zeros (%)99.1%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.892030image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum13
Range13
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.36833145
Coefficient of variation (CV)15.774762
Kurtosis732.73568
Mean0.023349414
Median Absolute Deviation (MAD)0
Skewness25.199876
Sum847
Variance0.13566806
MonotonicityNot monotonic
2025-03-20T11:04:32.927652image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0 35937
99.1%
1 198
 
0.5%
2 46
 
0.1%
3 43
 
0.1%
11 25
 
0.1%
5 11
 
< 0.1%
4 10
 
< 0.1%
13 4
 
< 0.1%
6 1
 
< 0.1%
ValueCountFrequency (%)
0 35937
99.1%
1 198
 
0.5%
2 46
 
0.1%
3 43
 
0.1%
4 10
 
< 0.1%
5 11
 
< 0.1%
6 1
 
< 0.1%
11 25
 
0.1%
13 4
 
< 0.1%
ValueCountFrequency (%)
13 4
 
< 0.1%
11 25
 
0.1%
6 1
 
< 0.1%
5 11
 
< 0.1%
4 10
 
< 0.1%
3 43
 
0.1%
2 46
 
0.1%
1 198
 
0.5%
0 35937
99.1%

no_of_previous_bookings_not_canceled
Real number (ℝ)

High correlation  Zeros 

Distinct59
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.15341144
Minimum0
Maximum58
Zeros35463
Zeros (%)97.8%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:32.971994image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum58
Range58
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.7541707
Coefficient of variation (CV)11.434419
Kurtosis457.38009
Mean0.15341144
Median Absolute Deviation (MAD)0
Skewness19.250191
Sum5565
Variance3.0771149
MonotonicityNot monotonic
2025-03-20T11:04:33.025658image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 35463
97.8%
1 228
 
0.6%
2 112
 
0.3%
3 80
 
0.2%
4 65
 
0.2%
5 60
 
0.2%
6 36
 
0.1%
7 24
 
0.1%
8 23
 
0.1%
10 19
 
0.1%
Other values (49) 165
 
0.5%
ValueCountFrequency (%)
0 35463
97.8%
1 228
 
0.6%
2 112
 
0.3%
3 80
 
0.2%
4 65
 
0.2%
5 60
 
0.2%
6 36
 
0.1%
7 24
 
0.1%
8 23
 
0.1%
9 19
 
0.1%
ValueCountFrequency (%)
58 1
< 0.1%
57 1
< 0.1%
56 1
< 0.1%
55 1
< 0.1%
54 1
< 0.1%
53 1
< 0.1%
52 1
< 0.1%
51 1
< 0.1%
50 1
< 0.1%
49 1
< 0.1%

avg_price_per_room
Real number (ℝ)

Zeros 

Distinct3930
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103.42354
Minimum0
Maximum540
Zeros545
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:33.077710image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile61
Q180.3
median99.45
Q3120
95-th percentile165
Maximum540
Range540
Interquartile range (IQR)39.7

Descriptive statistics

Standard deviation35.089424
Coefficient of variation (CV)0.33927889
Kurtosis3.154125
Mean103.42354
Median Absolute Deviation (MAD)20.25
Skewness0.66713287
Sum3751688.9
Variance1231.2677
MonotonicityNot monotonic
2025-03-20T11:04:33.241706image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65 848
 
2.3%
75 826
 
2.3%
90 703
 
1.9%
95 669
 
1.8%
115 662
 
1.8%
120 612
 
1.7%
100 604
 
1.7%
110 560
 
1.5%
0 545
 
1.5%
85 506
 
1.4%
Other values (3920) 29740
82.0%
ValueCountFrequency (%)
0 545
1.5%
0.5 1
 
< 0.1%
1 9
 
< 0.1%
1.48 1
 
< 0.1%
1.6 1
 
< 0.1%
2 6
 
< 0.1%
3 3
 
< 0.1%
4.5 1
 
< 0.1%
6 25
 
0.1%
6.5 1
 
< 0.1%
ValueCountFrequency (%)
540 1
 
< 0.1%
375.5 1
 
< 0.1%
365 1
 
< 0.1%
349.63 1
 
< 0.1%
332.57 1
 
< 0.1%
316 1
 
< 0.1%
314.1 1
 
< 0.1%
306 2
 
< 0.1%
300 5
< 0.1%
299.33 1
 
< 0.1%

no_of_special_requests
Real number (ℝ)

Zeros 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.61965541
Minimum0
Maximum5
Zeros19777
Zeros (%)54.5%
Negative0
Negative (%)0.0%
Memory size283.5 KiB
2025-03-20T11:04:33.280446image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7862359
Coefficient of variation (CV)1.2688276
Kurtosis0.88143702
Mean0.61965541
Median Absolute Deviation (MAD)0
Skewness1.1450808
Sum22478
Variance0.61816689
MonotonicityNot monotonic
2025-03-20T11:04:33.311758image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 19777
54.5%
1 11373
31.4%
2 4364
 
12.0%
3 675
 
1.9%
4 78
 
0.2%
5 8
 
< 0.1%
ValueCountFrequency (%)
0 19777
54.5%
1 11373
31.4%
2 4364
 
12.0%
3 675
 
1.9%
4 78
 
0.2%
5 8
 
< 0.1%
ValueCountFrequency (%)
5 8
 
< 0.1%
4 78
 
0.2%
3 675
 
1.9%
2 4364
 
12.0%
1 11373
31.4%
0 19777
54.5%

booking_status
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size283.5 KiB
Not_Canceled
24390 
Canceled
11885 

Length

Max length12
Median length12
Mean length10.689456
Min length8

Characters and Unicode

Total characters387760
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNot_Canceled
2nd rowNot_Canceled
3rd rowCanceled
4th rowCanceled
5th rowCanceled

Common Values

ValueCountFrequency (%)
Not_Canceled 24390
67.2%
Canceled 11885
32.8%

Length

2025-03-20T11:04:33.350610image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-20T11:04:33.377739image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
not_canceled 24390
67.2%
canceled 11885
32.8%

Most occurring characters

ValueCountFrequency (%)
e 72550
18.7%
C 36275
9.4%
a 36275
9.4%
n 36275
9.4%
c 36275
9.4%
l 36275
9.4%
d 36275
9.4%
N 24390
 
6.3%
o 24390
 
6.3%
t 24390
 
6.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 387760
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 72550
18.7%
C 36275
9.4%
a 36275
9.4%
n 36275
9.4%
c 36275
9.4%
l 36275
9.4%
d 36275
9.4%
N 24390
 
6.3%
o 24390
 
6.3%
t 24390
 
6.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 387760
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 72550
18.7%
C 36275
9.4%
a 36275
9.4%
n 36275
9.4%
c 36275
9.4%
l 36275
9.4%
d 36275
9.4%
N 24390
 
6.3%
o 24390
 
6.3%
t 24390
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 387760
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 72550
18.7%
C 36275
9.4%
a 36275
9.4%
n 36275
9.4%
c 36275
9.4%
l 36275
9.4%
d 36275
9.4%
N 24390
 
6.3%
o 24390
 
6.3%
t 24390
 
6.3%

Interactions

2025-03-20T11:04:31.205594image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.069493image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.578067image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.004700image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.437380image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.900163image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.339215image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.847007image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.302589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.791332image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.245201image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.112729image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.619268image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.047414image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.541573image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.939932image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.380475image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.893319image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.344623image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.833913image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.286885image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.155269image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.662599image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.091363image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.582602image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.989366image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.424665image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.941308image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.423911image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.876457image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.330415image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.199112image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.707910image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.135542image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.624017image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.056955image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.469788image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.991360image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.473191image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.920064image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.368767image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.237925image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.748999image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.177532image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.661375image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.095856image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.510553image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.036323image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.517308image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.960133image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.500345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.278235image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.791218image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.220454image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.699752image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.135127image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.551810image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.081325image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.565664image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.000075image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.542132image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.318680image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.833430image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.263800image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.739936image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.175423image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.593430image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.130127image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.611844image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.041362image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.584151image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.364245image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.876906image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.307798image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.779534image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.216612image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.635718image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.173897image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.657252image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.082532image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.626014image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.436469image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.921328image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.351137image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.820672image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.258931image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.679434image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.218087image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.702972image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.125886image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.666888image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.537793image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:27.963497image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.394746image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:28.860368image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.298872image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:29.720535image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.260476image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:30.747389image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-03-20T11:04:31.164855image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-03-20T11:04:33.412336image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
arrival_datearrival_montharrival_yearavg_price_per_roombooking_statuslead_timemarket_segment_typeno_of_adultsno_of_childrenno_of_previous_bookings_not_canceledno_of_previous_cancellationsno_of_special_requestsno_of_week_nightsno_of_weekend_nightsrepeated_guestrequired_car_parking_spaceroom_type_reservedtype_of_meal_plan
arrival_date1.000-0.0430.0860.0070.0350.0000.0470.0360.029-0.006-0.0180.020-0.0100.0290.0330.0070.0250.073
arrival_month-0.0431.0000.3950.0160.1720.0810.1040.095-0.009-0.0030.0110.0900.045-0.0100.0750.0680.0450.099
arrival_year0.0860.3951.0000.1720.1790.1470.1890.1020.0280.0210.0220.0950.0300.0720.0170.0150.1130.196
avg_price_per_room0.0070.0160.1721.0000.165-0.0210.3170.1610.244-0.178-0.1030.1980.018-0.0260.1610.0640.2780.104
booking_status0.0350.1720.1790.1651.0000.4380.1490.0960.0370.0570.0430.2580.1060.0770.1070.0860.0380.087
lead_time0.0000.0810.147-0.0210.4381.0000.1760.098-0.026-0.191-0.101-0.0810.2450.0990.1640.0690.0670.173
market_segment_type0.0470.1040.1890.3170.1490.1761.0000.1990.0620.1560.1060.2080.1150.1190.4690.1260.1650.229
no_of_adults0.0360.0950.1020.1610.0960.0980.1991.0000.1810.0690.0430.1120.0750.0680.2240.0180.3290.090
no_of_children0.029-0.0090.0280.2440.037-0.0260.0620.1811.000-0.034-0.0260.1350.0190.0310.0250.0320.4060.037
no_of_previous_bookings_not_canceled-0.006-0.0030.021-0.1780.057-0.1910.1560.069-0.0341.0000.4170.001-0.123-0.0660.5310.0670.0340.020
no_of_previous_cancellations-0.0180.0110.022-0.1030.043-0.1010.1060.043-0.0260.4171.000-0.024-0.045-0.0320.3840.0330.0380.015
no_of_special_requests0.0200.0900.0950.1980.258-0.0810.2080.1120.1350.001-0.0241.0000.0450.0660.0390.0950.0750.070
no_of_week_nights-0.0100.0450.0300.0180.1060.2450.1150.0750.019-0.123-0.0450.0451.0000.0180.1220.0580.0450.065
no_of_weekend_nights0.029-0.0100.072-0.0260.0770.0990.1190.0680.031-0.066-0.0320.0660.0181.0000.0670.0290.0300.045
repeated_guest0.0330.0750.0170.1610.1070.1640.4690.2240.0250.5310.3840.0390.1220.0671.0000.1100.0670.074
required_car_parking_space0.0070.0680.0150.0640.0860.0690.1260.0180.0320.0670.0330.0950.0580.0290.1101.0000.0450.034
room_type_reserved0.0250.0450.1130.2780.0380.0670.1650.3290.4060.0340.0380.0750.0450.0300.0670.0451.0000.146
type_of_meal_plan0.0730.0990.1960.1040.0870.1730.2290.0900.0370.0200.0150.0700.0650.0450.0740.0340.1461.000

Missing values

2025-03-20T11:04:31.774721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-03-20T11:04:31.860544image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

no_of_adultsno_of_childrenno_of_weekend_nightsno_of_week_nightstype_of_meal_planrequired_car_parking_spaceroom_type_reservedlead_timearrival_yeararrival_montharrival_datemarket_segment_typerepeated_guestno_of_previous_cancellationsno_of_previous_bookings_not_canceledavg_price_per_roomno_of_special_requestsbooking_status
02012Meal Plan 10Room_Type 12242017102Offline00065.000Not_Canceled
12023Not Selected0Room_Type 152018116Online000106.681Not_Canceled
21021Meal Plan 10Room_Type 112018228Online00060.000Canceled
32002Meal Plan 10Room_Type 12112018520Online000100.000Canceled
42011Not Selected0Room_Type 1482018411Online00094.500Canceled
52002Meal Plan 20Room_Type 13462018913Online000115.001Canceled
62013Meal Plan 10Room_Type 13420171015Online000107.551Not_Canceled
72013Meal Plan 10Room_Type 48320181226Online000105.611Not_Canceled
83004Meal Plan 10Room_Type 1121201876Offline00096.901Not_Canceled
92005Meal Plan 10Room_Type 44420181018Online000133.443Not_Canceled
no_of_adultsno_of_childrenno_of_weekend_nightsno_of_week_nightstype_of_meal_planrequired_car_parking_spaceroom_type_reservedlead_timearrival_yeararrival_montharrival_datemarket_segment_typerepeated_guestno_of_previous_cancellationsno_of_previous_bookings_not_canceledavg_price_per_roomno_of_special_requestsbooking_status
362652013Meal Plan 10Room_Type 1152018530Online000100.730Not_Canceled
362662022Meal Plan 10Room_Type 28201834Online00085.961Canceled
362672010Not Selected0Room_Type 1492018711Online00093.150Canceled
362681003Meal Plan 10Room_Type 11662018111Offline000110.000Canceled
362692201Meal Plan 10Room_Type 602018106Online000216.000Canceled
362703026Meal Plan 10Room_Type 485201883Online000167.801Not_Canceled
362712013Meal Plan 10Room_Type 122820181017Online00090.952Canceled
362722026Meal Plan 10Room_Type 1148201871Online00098.392Not_Canceled
362732003Not Selected0Room_Type 1632018421Online00094.500Canceled
362742012Meal Plan 10Room_Type 120720181230Offline000161.670Not_Canceled

Duplicate rows

Most frequently occurring

no_of_adultsno_of_childrenno_of_weekend_nightsno_of_week_nightstype_of_meal_planrequired_car_parking_spaceroom_type_reservedlead_timearrival_yeararrival_montharrival_datemarket_segment_typerepeated_guestno_of_previous_cancellationsno_of_previous_bookings_not_canceledavg_price_per_roomno_of_special_requestsbooking_status# duplicates
3371002Meal Plan 10Room_Type 11922018624Offline00095.00Not_Canceled91
4191003Meal Plan 10Room_Type 1712018614Offline000120.00Not_Canceled89
3291002Meal Plan 10Room_Type 11642017102Offline000100.00Not_Canceled87
13602003Meal Plan 10Room_Type 13720181013Offline000105.00Not_Canceled84
3351002Meal Plan 10Room_Type 11882018615Offline000130.00Canceled83
12302002Meal Plan 20Room_Type 1392017814Offline000101.50Not_Canceled71
20662012Meal Plan 10Room_Type 13052018114Offline00089.00Canceled71
14842003Meal Plan 10Room_Type 13042018113Offline00089.00Canceled68
4361003Meal Plan 10Room_Type 11662018111Offline000110.00Canceled66
8862001Meal Plan 10Room_Type 156201868Offline000120.00Not_Canceled60